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Abstract 

For any given e > 0 we provide an algorithm for the Qnadratic Knapsack 
Problem that has an approximation ratio within and a rnn time 

within 
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1. Introduction 

The qnadratic knapsack problem (QKP) can be defined in graph theoretic 
terms as follows. Let G be a graph with vertices Vi,i = with costs 

Cj > 0 associated with the vertices and profits Pij > 0 associated with each 
edge ij and each vertex i (corresponding to pu). Given a cost limit c the 
problem is to find a collection of vertices with total cost at most c with 
the total proht (the snm of all profits of vertices and edges in the indnced 
snbgraph) as large as possible. A particnlar form of this problem where the 
costs and profits are all 1 is eqnivalent to the densest k-snbgraph problem 
(DkS). Tims DkS asks for the selection of k vertices so that the indnced 
snbgraph on those vertices has a maximnm nnmber of edges. The reader is 
referred to [5] for a snrvey of the QKP problem, and the recent article [6] on 
approximating QKP for special graph classes. In particnlar we note that [6] 
has no resnlts on approximating QKP in the general case, and the anthor 
is not aware of any snch previonsly pnblished resnlts. Note that the related 
problem where negative prohts are allowed is known to be inapproximable 

d- 

2. The Theorem 

Theorem. Let M be any approximation algorithm for DkS that has an 
approximation ratio within < a < 1, e > 0 and rnns within time 
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Then M can be used to construct an approximation algorithm M' 

2q: _i_q^ k 

for QKP that has an approximation ratio within ) and runs within 

time 

Since [1] provides, for any e > 0, an approximation algorithm for DkS 
with ratio 0(n4+^) and runs within time the following corollary fol¬ 

lows from the theorem immediately (given e select M to have approximation 
ratio 0(n4+^/^) and run time 0{n^/^)). 

Corollary. There is an algorithm for the Quadratic Knapsack Problem that 
has, for any e > 0, an approximation ratio within and a run time 

within 

Proof of the theorem. Let G be a graph with cost limit c, and M 
an approximation algorithm for DkS that has an approximation ratio of 
^(^a+e), 0 < Q- < e > 0 and runs in time 

2.1. Overview 

We sketch the main flow of the proof method. We group the costs and 
prohts of the original instance into 0{logn) many buckets. This allows us 
to split the original instance into 0{{logn)^) sub-instances each with a sim- 
plihed structure. One of these sub-instances has costs and prohts on the 
vertices only, while each of the others have closely bounded costs, no vertex 
costs and the same edge prohts. In this way we group these sub-instances 
into hve classes: the hrst is an instance of the classical knapsack problem; 
while for the other four classes the QKP can be approximated provided we 
have an approximation for the DkS problem. Finally by a simple averaging 
argument at least one of the sub-instances must have a proht within a factor 
of 0{{lognY) of the original instance. See also [2] where this argument is 
used to approximate the weighted DkS problem from approximations of the 
DkS problem. This paper extends the method further to the QKP. 

2.2. Pruning, Rounding and Grouping 

We may prune all vertices with Cj > c and all edges ij where Cj + Cj > c 
since these cannot be part of a feasible solution. Further if Cj = 0 then the 
vertex can be pruned and the attached edge values added to the neighbouring 
vertices without affecting the problem. Also if pij = 0 then the edge ij can 
be pruned. 

Notation: Let MAX[G,c\ be the maximum total proht of the solution to 
QKP applied to the Graph G with cost limit c. Let G^^^[c\ be a subgraph 


2 


of G corresponding to a maximum solution (so the vertex costs of 
are at most c, and the prohts MAX[G,c\). Let any subgraph in which the 
total cost of the vertices is at most c be termed feasible. Let \E{G)\ denote 
the number of edges in any graph G. 

Now let p* be the largest Pij and 2^ the largest power of 2 at most p*. Form 
the graph G' from G with each pij rounded down to the nearest proht among 
{ 2 *, 2 ^“^, 2 ^“'^, 0} where q is the smallest integer above log 2 n^ = 2 log 2 n. 
Then MAX[G\ c] > 1/4:MAX[G, c] since the effect of deleting all edges with 
prohts less than 2 ^“'^ and rounding down the remaining prohts each reduce 
the maximum total proht by a factor of at most 1/2. Similarly let c* be the 
largest Cj and 2^ the smallest power of 2 at least c*. Now group the vertices 
into buckets 14 , i = 1 ,Z + 1 where I is the smallest integer above log 2 n and 
Vi,i ^ I is the collection of vertices Vj where 2 ^“* < Cj < and 14 +i is 

the collection of vertices Vj where 0 < Cj < 2^~K 

2.3. Five subgraph classes 

The edge and vertex prohts of G' can be shared among at most 2{log2n + 
1)^ + 1 subgraphs within hve classes, with Classes 2-5 constructed based 
on the cost groupings and proht roundings. We shall show how to obtain 
( 9 ( 77 , 1 +^+ 2 ':) Qj. better approximations for each of these classes in time 
or less. 

Class 1: The subgraph of G' obtained by removing all of the edges. 

Class 2: Subgraphs with costs within I 4+1 and edge prohts some power of 2, 
and no vertex prohts. 

Class 3: Subgraphs with costs within 14,'Z < Z and edge prohts some power 
of 2 , and no vertex prohts. 

Class 4: Bipartite subgraphs with one part having costs within I 4 + 1 , the 
other with costs within 14,* < Z, and edge prohts some power of 2, and no 
vertex prohts. 

Class 5: Bipartite subgraphs with one part having costs within 14,* < Z, the 
other with costs within l^',* < j < Z, and edge prohts some power of 2, and 
no vertex prohts. 

Class 1 corresponds to the Knapsack problem and so has an algorithm 
that has an approximation ratio within 1 -|- e and a run time within OirA je) 
0. 111. Since c > c* then 2^ ^ < c/n so that Class 2 subgraphs have the 
sum of all vertex costs at most c. Thus the maximum edge proht is found 
by summing all the edge prohts. 

Class 3: Let H be any subgraph in this class. Let the equal edge prohts 
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be 2^. We can simplify by scaling the costs and profits so that the vertex 
costs are between 1 and 2, and the edge prohts are all 1. Specihcally form 
a modihed subgraph H by dividing the (vertex) costs by 2^+*“*, and setting 
the edge prohts to 1. Set the cost limit c for H to c = . Then 

MAX\H,c\ = ^^MAX[E,~d,. ( 1 ) 


Thus approximating the QKP for H, c is equivalent to approximating the 
QKP for H, c. 

We now analyse the graph H with cost limit c. Observe that the number 
of vertices r in any maximum solution must be between c/2 and c. So let 
H^^^[c] have r vertices. We may assume r > 4 or else the problem is trivial. 
Let = [c/2j be any subgraph of H with a maximum number of edges 
formed by choosing t vertices. In particular we can choose t vertices from 
ways and the collection of these choices counts each edge 
of It follows that some such subgraph with t vertices 

must have a number of edges at least 





r(r — 1) 10 


( 2 ) 


the minimum corresponding to r = 5,f = 2. Thus must have at least as 
many edges as this subgraph and is also feasible. H* can be approximated 
by algorithm M applied to the DkS problem for H, t and this provides an 
^(^o+e) approximation algorithm that runs in time We note that 

a < 2q;/(1 + a). This completes this class. 

In the following for a bipartite graph H[A,B] with parts A and B, if 
A' C A, B' C B then H[A', B'] refers to the subgraph of H[A, B] induced by 
A' and B'. 

Class 4: Let H[A, B] be any subgraph in this class with parts A and B, with 
the vertex costs of A between 0 and 2^“*, and the vertex costs of B between d 
and 2d, d a power of 2. Since the edge prohts are all equal we can scale them 
as in Class 3, so that they are all 1 in the modihed graph H[A, B] with cost 
limit c. We may assume c> Ad otherwise the problem is essentially hnite by 
considering all choices of at most 4 of the vertices of B. For each such choice 
the choice of corresponding vertices from A amounts to an instance of the 
Knapsack problem. Similarly we assume 1^41 >4. 
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Select B' to be the c/{Ad) highest degree vertices of B in H[A,B]. Then 
\E{H[A,B'])\ is at least {1 /A)MAX[H,c\ since any feasible solution has at 
most c/d vertices from B. Now select A' to be the \A\/A highest degree ver¬ 
tices of A in fflA,B']. Then \E{H[A\&])\ > {1/A)\E{H[A, B'])\. Thus 
\E{H[A\B'])\ > (l/16)MAX[if, c], and since 2^“^ < cjn the subgraph 
\E{H[A', B'])\ is also feasible as the total cost of vertices in each part is 
at most c/2. This completes this class. 

Class 5: Let H[A, B] be any subgraph in this class with parts A and B. 
As in Classes 3 and 4 we modify H[A,B] to H[A,B] and the cost limit c 
to c, where in H[A,B] the edge profits are 1, the vertices in A have costs 
between 1 and 2, and the vertices of B have costs between d and 2d, where 
d is a positive power of 2 as in hgure 1. We may also assume c> Ad (see the 
argument in Class 4). We consider two cases. 

Case 1: \A\ < c(i+")/(i-«); Select the c/(4d) highest degree vertices & of B. 
Then \E{H[A, B'])\ > l/AMAX[H,c] since any feasible solution can have 
at most c/d vertices of B. Now select the c/4 highest degree vertices A' of 
A in H[A,B']. Then H[A',B'] must have at least c/(4|A|) of the edges of 
H[A, 13']. So H[A', B'] is feasible and since n > \A\ 

\E(HIA',B'])\ > AT|B(ff|i,B'])| 

4 |/ 1 | 

> X'SiMAXyH.c], (3) 

16 

Case 2: \A\ > This case is the most challenging and we provide a 

graph transformation method to deal with it. Construct a graph H* from H 
by replacing Bhj d copies Bi,B^ of the vertices of B with edges between A 
and each Bi copying the edges between A and B. Thus if A = {oj, i = f, r}, 
B = {bi, i = I ,.., s} and Bj = {bij, i = I, ■■, s,j = 1,d} then 

(oj, bj) is an edge of H iff (a^, bjk), /c = 1,.., d is an edge of H*. (4) 

In H* set cost(6jj) = (l/d)cost(6i) and so each cost{bij) is between 1 and 2 (see 
hgure 1). Now MAX[H*,c] > dMAX[H,c] since any feasible subgraph of 
H can be replicated d times to a feasible solution of H*. Use algorithm M to 
approximate the DkS problem for H* with k = cto within an approximation 
ratio of say s. Let H**[A', B'^, B' 2 ,be an induced subgraph correspond¬ 
ing to this approximation. Dehne Sij be the degree of bij in H** (if bij is not 
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Figure 1: bipartite graphs H and H* 


in H** then 6ij = 0), and S* = maxj{6ij}. Now choose a subgraph of H as 
follows. Select B' to be those bi for which 6* is among the c/ {4d) highest 
values among Then since Ui?' contains at most c vertices 

|B(«KB'|)| > ^\E(H"IA',B[,B’„..,B2)\ > ^^MAXIH’,S]. (5) 

Now select N' to be the c/4 highest degrees of N in H[A',B']. Since \A'\ is 
at most c, \E{H[A\B'])\ > {l/A)\E{H[A\&])\ and so 

\E{H[A\B’])\ > ^MAX[H\b\ > ^MAX[H,c]. (6) 


H[A",B'] is feasible since the total cost of vertices in each part is at most 
c/2. By the inequality dehning this case 


c < |y4| i+“ < ni+° 

so El* has at most nd vertices where 


Thus 


1 _ 1 1 I l-a 1 2 

nd < -nc < -n i+“ = -ni+“. 
“44 4 




(7) 

( 8 ) 
(9) 
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Similarly the time taken for M to run is 

= 0(n'=(!+“)). (10) 

Noting that 2e/(H-Q;) < 2e and 2/(e(l + Q;)) <2le the selection oiRlA'^B'] 
through the use of M provides an approximation ratio of at most 
to MAX[H^ c] by inequality 6 , and this is obtained within time This 

completes this case and also this class. 

To complete the proof of the theorem we note that [c] must share a 

vertex/edge proht of at least l/[ 2 (/o 5 f 2 ^^ + l)^ + l] of MAX[G', c] with at least 
one of the subgraphs H in classes 1-5. Thus the largest proht found among 
the approximations to MAX[H,c\ over the subgraphs must approximate 
MAX[G', c], and so MAX[G,c], to within 

0{{log2n)^n^+°‘^ ) = 0{n^+°‘ ). (11) 

For n sufficiently large the term 3log{log2n)/logn is less than e and the desired 
approximation bound follows. The order of the run time can similarly be 
shown to be at most 0 (n^/^). 
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